Integrating Syntagmatic Information in a Dictionary for Computer Speech Applications
نویسنده
چکیده
Conventional dictionaries, albeit they often comprise an impressive amount o f para digmatic information on various aspects o f linguistic description, usually pay only little attention to the representation o f syntagmatic information. Admittedly, apart hrom spelling conventions and rules o f inflectional agreement, the co-occurrence o f indivi dual lexical items will not normally change the orthographic shape o f a word when it appears in written text. In spoken language, however, the phonetic realization o f words is heavily influenced by context and may change dramatically in a variety o f ways, including segmental as well as prosodic features. These changes need to be taken into account in both computer speech synthesis and automatic qreech recognition. In this paper, dierefore, we argue fathe inclusion of syntagmatic information in dictionaries which are developed for the special purpose o f spdcen language processing in computer speech applications. Two kinds of syntagmatic information will be considered in more detail: Case Frames and Collocations.
منابع مشابه
Speech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملA new Dictionary of Swedish Pronunciation
This paper describes some aspects of a pronunciation dictionary for Swedish, "Svenskt Utlalslexikon" (SUL), which is piesenUy developed at our departm ent This dictionary provides, among other items, three kinds of information about Swedish pronunciation that are not included in standard dictionaries: information on varian ts , on inflected form s and com pounds, and on p ro p er names. SUL is ...
متن کاملAutomatic Construction of Persian ICT WordNet using Princeton WordNet
WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...
متن کاملA New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملConcept-to-speech generation by integrating syntagmatic features into HMM-based speech synthesis
In conventional concept-to-speech (CTS) methods, a common step is predicting abstract prosodic descriptions, such as the locations of accents and phrase boundaries, from the linguistic information provided by the text generation module. But the prediction results always contain errors, and unacceptable prosodic prediction may ruin the synthesized speech. In addition, linguistic information, whi...
متن کامل